Declarative Languages for Querying Portal Catalogs
نویسندگان
چکیده
As data is increasingly captured, aggregated, and digitized worldwide, new types of information systems, such as digital libraries and information (subject) gateways, emerge as core technologies of the 21st-century economy. After a rst generation of systems focusing on the accessibility of available information resources, nowadays, high quality information collections are smoothly transformed into Community Web Portals. These Portals provide the means to select, classify and access, in a semantically meaningful and ubiquitous way, diverse information resources in order to develop and maintain speci c communities of interests (e.g., professional, trading, etc.) on corporate intranets or the Web. A key Portal component is the Knowledge Catalog holding descriptive information, i.e., metadata, about the community resources (e.g., sites, documents, data, etc.). Despite the current developments in standards for describing the content and meaning of information resources (see the W3C Metadata Activity), declarative languages suitable for querying both their semantic descriptions and the employed schemas are still missing. In this paper we present such a high-level query language for Portal Catalogs (e.g., as Open Directory, CNET, XMLNews) created according to the Resource Description Framework (RDF) standard [15, 4]. RDF [15] aims at facilitating the creation and exchange of metadata as any other Web data. RDF resource descriptions are represented as directed labeled graphs (where nodes are called resources or literals and edges are called properties) which can be serialized in XML. Furthermore, RDF schema [4] vocabularies are used to de ne the labels of nodes (called classes) and edges that can be used to describe and query resources in speci c communities. These labels can be organized into appropriate taxonomies, carrying the inclusion semantics of subjects/topics in a Portal Catalog. In this context, our query language, called RQL, relies on a graph data model allowing us to interpret semistructured RDF descriptions by means of one or more RDF schemas. Note that RDF schemas (a) do not impose a strict typing on the data (by e.g., permitting multiple classi cation, optional and repeated properties); (b) can be easily extended (e.g., through specialization of both classes and property types); (c) may provide only a partial or overlapped interpretation of the underlying data (e.g., by having several, eventually incomplete schemas for the same resource descriptions); and (d) are not entirely separated from the resource descriptions (i.e., they can be queried like normal data). Thus, RQL shares the exibility and utility of the recent proposals for semistructured or XML query languages, while, at the same time, extending their functionality to the RDF schema level by exploring in a transparent way the de ned taxonomies of classes and properties, as well as, the multiple classi cation of resources. To the best of our knowledge, RQL is the rst language to smoothly combine features from thesauri-based information retrieval systems (i.e., term expansion mechanisms [12]) with semistructured or XML query languages featuring variables on both property and class names (i.e., generalized path expressions [1]). Our work is motivated by the fact that existing semistructured models (e.g., OEM [18], YAT [8]) cannot capture the semantics of node and edge labels provided by RDF schemas (i.e., taxonomies of classes and property types), while semistructured or XML query languages (e.g., LOREL [2], UnQL [5], StruQL [11], XML-QL [10], XML-GL [7]) are not suited to exploit RDF schema information (i.e., pattern vs. semantic matching of labels). On the other hand, database (relational or object) schema query languages as SchemaSQL [14], XSQL [13] or Noodle [17] fail to fully accommodate RDFS features such as specialization of properties. Furthermore, they have been
منابع مشابه
Querying Community Web Portals
A new generation of information systems such as organizational memories, vertical aggregators, infomediaries, etc. is emerging nowadays. Such systems, termed Community Web Portals, intend to support speci c communities of interest (e.g., enterprise, professional, trading) on corporate intranets or the Web. More precisely, Portal Catalogs, organize and describe various information resources (e.g...
متن کاملQuerying RDF Descriptions for Community Web Portals1
Community Web Portals (e.g., digital libraries, vertical aggregators, infomediaries) have become quite popular nowadays in supporting specific communities of interest on corporate intranets or the Web. Portal Catalogs, organize and describe various information resources (e.g., sites, documents, data) for diverse target audiences (corporate, interenterprise, e-marketplace, etc.), in a multitude ...
متن کاملOn Personalizing the Catalogs of Web Portals
In this paper we propose a method for personalizing the catalogs of Web Portals. We propose SCSL, a declarative language for defining personal semantic channels over Web Portal catalogs. A semantic channel is actually a view of one or more Portal catalogs. SCSL offers powerful primitives for filtering and restructuring available thematic topics and classified resources. A user can connect to a ...
متن کاملNaviMoz: Mining Navigational Patterns in Portal Catalogs
Portal Catalogs is a popular means of searching for information on the Web. They provide querying and browsing capabilities on data organized in a hierarchy, on a category/subcategory basis. This paper presents mining techniques on user navigational patterns in the hierarchies of portal catalogs. Specifically, we study and implement navigation retrieval methods and clustering tasks based on nav...
متن کاملQuerying Evolving Graphs with Portal
Graphs are used to represent a plethora of phenomena, from the Web and social networks, to biological pathways, to semantic knowledge bases. Arguably the most interesting and important questions one can ask about graphs have to do with their evolution. Which Web pages are showing an increasing popularity trend? How does influence propagate in social networks? How does knowledge evolve? Much res...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000